Feature selection for binary classification based on class labeling, SOM, and hierarchical clustering
نویسندگان
چکیده
Feature selection plays an important role in algorithms for processing high-dimensional data. Traditional pattern classification and information theory methods are widely applied to feature methods. However, traditional such as Fisher Score, Laplacian relief use class labels inadequately. Previous based MIFS ignore the intra-class tight inter-class sparse property of samples. To address these problems, a algorithm binary problem is proposed, which on label transformation using self-organizing mapping neural network (SOM) cohesive hierarchical clustering. The first converts without numerical meaning into values that can participate operations retain through mapping, constitutes two-dimensional vector from it attribute be judged. Then, vectors clustered by SOM Finally, evaluation function value calculated, closely related intra-cluster tightness, inter-cluster separation, division accuracy after clustering, used evaluate ability alternative attributes distinguish between classes. It experimentally verified robust effectively screen with strong improve prediction performance classifier.
منابع مشابه
Optimal Feature Selection for Data Classification and Clustering: Techniques and Guidelines
In this paper, principles and existing feature selection methods for classifying and clustering data be introduced. To that end, categorizing frameworks for finding selected subsets, namely, search-based and non-search based procedures as well as evaluation criteria and data mining tasks are discussed. In the following, a platform is developed as an intermediate step toward developing an intell...
متن کاملOptimal Feature Selection for Data Classification and Clustering: Techniques and Guidelines
In this paper, principles and existing feature selection methods for classifying and clustering data be introduced. To that end, categorizing frameworks for finding selected subsets, namely, search-based and non-search based procedures as well as evaluation criteria and data mining tasks are discussed. In the following, a platform is developed as an intermediate step toward developing an intell...
متن کاملA Feature Selection Methods Based on Concept Extraction and SOM Text Clustering Analysis
The feature selection is an important part in automatic classification. In this paper, we use the HowNet to extract the concept attributes from word to build a feature set. However, as the concept defi4nition sometimes is too weak in expression, we set a shielded level in the sememe Tree and filter the concept attributes which can not give enough information for classification, and reserve the ...
متن کاملA Feature Selection Method Based on Concept Extraction and SOM Text Clustering Analysis
The feature selection is an important part in automatic classification. In this paper, we use the HowNet to extract the concept attributes from word to build a feature set. However, as the concept defi4nition sometimes is too weak in expression, we set a shielded level in the sememe Tree and filter the concept attributes which can not give enough information for classification, and reserve the ...
متن کاملFeature Subset Selection for Multi-class SVM Based Image Classification
Multi-class image classification can benefit much from feature subset selection. This paper extends an error bound of binary SVMs to a feature subset selection criterion for the multi-class SVMs. By minimizing this criterion, the scale factors assigned to each feature in a kernel function are optimized to identify the important features. This minimization problem can be efficiently solved by gr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Measurement & Control
سال: 2023
ISSN: ['2051-8730', '0020-2940']
DOI: https://doi.org/10.1177/00202940231173748